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A METHOD FOR THE PRODUCTION OF HIV-1 
GAG VIRUS-LIKE PARTICLES 



FIELD OF THE INVENTION 

This invention relates to a method for the production of HIV-1 Gag virus-like particles, to 
the virus-like particles prepared by the method, and to the use of the virus-like particles 
in a vaccine. 

BACKGROUND OF THE INVENTION 

The HIV genome contains three main open reading frames: The gag open reading 
frame (Fig. 1) encodes a 55 kDa precursor protein (Pr55 Gag ) which is cleaved further by 
an HIV-encoded protease during virion maturation into three major structural proteins, a 
regulatory domain and 2 spacer peptides (Luciw, 1996). The structural proteins include 
the matrix (MA) protein (P1 7 — AA1 to AA1 32), the capsid (CA) protein (P24 - AA1 33 to 
AA363) and the nucleocapsid (NC) protein (P9 - AA377 to AA432). The regulatory 
domain (P6) spans AA449 to AA500 while the spacer regions P1 and P2 are located 
from AA433 to AA448 and AA364 to AA376 respectively (von Schwedler et a/., 1 998). 

The pol open reading frame overlaps that of gag from AA430 and is expressed' via a 
ribosomal frame-shifting event that occurs at a frequency of 5 to 10% during translation 
to produce a Gag-Pol precursor protein of 160kDa (Pr160) (Jacks et at., 1988). The pol 
gene encodes several open reading frames including that for the protease, reverse 
transcriptase, RNase H and integrase enzymes of HIV-1 . The env open reading frame 
lies further downstream of pol and encodes a 160 kDa precursor protein (gp160) of the 
viral envelope proteins gp41 and gp120 (Luciw, 1996). 

After infection of a host cell, HIV-1 RNA is reverse transcribed into DNA which is 
subsequently integrated into the host genome (proviral stage)/ The Gag and Gag-Pol 
precursors are translated from transcribed HIV-1 provirus RNA in the cytosol and 
targeted to the host cell membrane. The Gag precursor associates with two copies of 
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viral RNA and interacts with the Gag-Pol precursor to assemble into particle-like 
structures which line the host-cell membrane. They aggregate in such a way as to 
induce membrane curvature and subsequent bud formation during which viral Env 
proteins are also incorporated into the forming particles. The particles pinch off the 
membrane after which the HIV-1 particle maturation occurs, with the protease cleaving 
Gag and Gag-Pol into mature structural and functional proteins which lead to core 
condensation and thus a mature infectious virion. 

Pr55 Gag has, been shown to assemble into virus-like particles (VLPs) in the absence of 
any other HIV-encoded genes in both mammalian and insect ceils. These particles 
closely resemble the morphology of immature HIV virions and are non-infectious 
(Overton et a/., 1989; Gheysen et a/., 1989; Royer et a/., 1991; Roye'r et a/., 1992; 
Shioda and Shibuta, 1990; Vernon et a/. , 1991; Mergener et a/., 1992). A number of 
Gag domains have been shown to be important in driving this particle assembly process 
and it has been shown that in fact about 80% of this precursor protein can be either 
deleted or replaced by heterologous sequences without significantly compromising VLP 
production (Accola et a/, 2000). These important domains are discussed below with 
respect to the functions of the individual proteins comprising Gag. 

MA protein (d17) 

The MA domain of Gag (Figure 2) comprises a total of 132 amino acids and is 
responsible for targeting Gag precursor protein to the plasma membrane and virus-like 
particle assembly. The M domain (retrovirus membrane-binding domain) at the N- 
terminal of MA is mostly responsible for this function. MA has an N-terminal glycine 
residue which has been shown to be required for targeting Gag to the host cell 
membrane and facilitating particle assembly (Gheysen et a/., 1989). For this to occur, 
the glycine residue has to be myristylated. The amino acid recognition sequence for 
myristylation to occur at the N-terminus of Gag is gly-x-x-x-ser/thr. 

The targeting and accumulation of HIV-1 Gag precursor at the host cell membrane by 
myristylation has been shown to occur in baculovirus-infected yeast cells, insect cells 
and mammalian cells (Jacobs et a/., 1989; Gheysen et a/, 1989; Bryant and Ratner, 
1990). Substitution of the glycine residue eradicated particle formation, 
complementation of the residue restored VLP production and when using myr" mutants, 
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Gag precursor was shown to accumulate in infected cell cytoplasm but did not associate 
with the host cell membrane. The myristyl moiety is thus required for stable membrane 
association of the particles. Only complete inhibition of Gag myristylation prevents VLP 
budding (Morikawa et al., 1996), i.e. only a few myristylated Gag molecules are sufficient 
5 for plasma membrane targeting and budding. 

Spearman (1997) has shown that myristate is the primary determinant of Gag-host cell 
membrane stability and therefore critical for particle assembly. Paillart and GSttlinger 
(1999) have proposed a model from results showing that the N-terminus of Gag is critical 
10 for insertion into the host cell membrane. 

Apart from the myristylation signal other regions of MA have been shown to be important 
for targeting Gag to the host cell membrane and subsequent particle assembly. Facke 
et al. (1993) showed that a large deletion of MA (AA16 to 99) caused drastic alteration of - 
particle morphogenesis leading to immature particles produced in the endoplasmic 
reticulum instead. MA is required for the proper assembly of envelope proteins into the 
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virion. 



Yuan et al. (1993) showed that various MA deletions and substitutions caused a 
20 dramatic reduction in virus particle production. They demonstrated that it is possible that 
a polybasic region in MA (AA20 to 32) serves as part of a Gag transport signal to the 
membrane. 



Zhou et al. (1994) studied this polybasic region further and showed that the highly basic 
residues form a positively charged surface which interacts with negatively charged 
phospholipids on the inner face of the lipid bilayer of the plasma. Ono and Freed (1999) 
have shown that a single mutation of MA (AA6 from V to R) severely impaired 
membrane binding without affecting myristylation. 

30 Capsid (CA) protein (n24) 

The CA domain (Figure 3) encodes a protein of approximately 230 amino acids in length 
and has several domains which appear to be important for particle assembly, the first of 
which is a major homology region (MHR). The region extending from the- N-terminus of 
CA downstream to the MHR is dispensible for particle formation, but any further 
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deletions extending further into the MHR impair particle production (Borsetti et a/., 1998). 
Zhao et al., (1994) also showed that baculovirus constructs of HIV-1 CA with a 10-amino 
acid deletion of AA140-150 as well as a separate deletion of AA250-260 led to the 
accumulation of viral protein at the cell membrane of insect cells. However there was no 
particle assembly or extracellular budding indicating that these two regions of CA at 
least, must play some role in normal particle formation. 

Oh the other hand, Borsetti et al. (1998) showed that efficient partfde formation occurred 
in the absence of both MA (excluding the myristyl anchor) and the N-terminal of the CA 
domain and therefore concluded that there is no distinct region between the myristyl 
anchor and MHR which is absolutely essential for efficient particle release or assembly. 
They concluded that the C-terminal half of Gag contains protein-protein interaction 
domains which are essential for efficient particle assembly. 

15 It seems that the C-terminal sequences may be required for protein-protein interactions 
but are not required for spherical particle formation and that the sphere is determined by 
the presence of an N-terminal extension on the CA domain. 

The presence of RNA (heterogeneous in size and of viral and cellular origins) within the 
20 particles has been reported (Gheysen and Shioda and Shibuta) 

Truncations into the P2 spacer regions have been shown to abolish particle formation. 



25 



Spacer region 2 (p2^ 

Borsetti et al., 1998 have shown that the presence or absence of p2 determines the 
assembly of Gag proteins into spherical particles or cylindrical particles respectively. 
Moakawa et at. (2000) have also verified that this region is essential for VLP production 
in that if this region is truncated in any way, VLP production is abolished. 

•iO ffrudsocaos id (NO nrotftin f P 7) 

r*e *C domain (Figure 4) has been shown to contain two well-conserved Cys-His boxes 
resembling zinc finger motifs often found in DNA binding proteins. These are thought to 
play a role in RNA binding and encapsidation but influence some other aspects of 
particle assembly as well. There are two highly basic regions flanking these two motifs 
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which have been shown to influence RNA binding in vitro and RNA encapsidation into 
virions if mutated. Jowett et al. (1992) showed that the deletion of the second Cys-His 
box did not affect particle formation but reduced RNA binding substantially. However, 
they also showed that deletion of both Cys-His boxes encouraged the formation of larger 
particles and the loss of RNA binding altogether. The deletion of sequences upstream of 
the Cys-His boxes caused the abolition of particle-forming ability. 

Dawson andYu (1998) showed that the NC domain is essential for efficient assembly of 
HIV-1 and for the production of particles with wildtype density. 



In addition to the Cys-His boxes, an I domain (interaction or assembly domain) close to 
the N-terminal of NC has been identified which is responsible for the formation of Gag 
protein complexes and also for the formation of punctate foci of Gag proteins at the 
plasma membrane. There are two positively charged basic arginine residues (AA380 
15 and AA384) which have been shown to be critical for the function of the N-terminal I 
domain (interaction or assembly domain). Sandefur et al. (2000) have shown that I 
domain-deficient mutants block the formation of budding VLPs. 

Spacer region 1 (p1) 

20 CA and NC are separated by a short spacer region SP1 which is a protease cleavage 
site. Wiegers et al. (1998) have shown that when cleavage of SP1 from NC is 
prevented, maturation of particles is delayed and the ribonucleoprotein core has an 
irregular morphology. However, when SP1 cleavage from CA is prevented, normal 
condensation of the ribonucleoprotein cone occurs but capsid condensation is 

25 prevented. They concluded from this that HIV maturation is a sequential process 
controlled by the rate of cleavage at individual sites. 

P6 protein 

Elements important for controlling particle size are contained within the C-terminal region- 
30 of gag (P6) (Figure 5) as various deletions and substitutions of this region have been 
shown to induce the formation of very large particles (Gamier et al., 1998). A specific 
domain referred to as the late (L) domain has been identified in P6 that is critical for the 
virus-cell separation step. This region contains a PTAPP amino acid sequence. 
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Sequences downstream of this domain in P6 were shown to be dispensable for virus 
release. 

Other features of Gaa infl uencing VLP formation 

Buck et al. (2001 ) have recently found that mRNA of the HIV-1 Gag open reading frame 
exhibits internal ribosome entry site (IRES) activity that promotes translational initiation 
of Gag, producing a 40 kDa Gag protein. This IRES is located at an internal AUG codon 
found near the N-terminus of CA. This may have consequences when Gag is expressed 
in in vitro systems since initiation of translation is thought to be promoted by the direct 
binding of ribosomes with the participation of other host cell factors quite independently 
of the mRNA cap. 

Minimal HIV-1 g a g seq ue nce s required for VLP assembly and release 
Although numerous domains of Gag have been shown to perform a particular function in 
Gag VLP assembly and release, deletion mutants of Gag have indicated that a number 
of these can be dispensable, and assembly of VLPs has been shown to be surprisingly 
tolerant to significant modifications of Gag protein (Wang et al., 1998; Accola et al 
. 2000; WBk et al., 2001). Wang et al., (1998) made a series of C-terminal truncated 
mutants with which to examine VLP-producing capabilities. Truncated Gag precursors 
lacking most of the C-terminal Gag assembled into particles and were released from 
mammalian cells. A mutant with most of MA and the entire p6 region deleted still 
produced particles although less than wildtype particles. The smallest Gag product 
capable of VLP assembly was a 28 kDa protein which consisted of a few MA amino 
acids and the CA-p2 domain. The N-terminal portions of CA appeared to be critical 
when most of the MA domain was deleted, suggesting a requirement for an intact CA 
domain to assemble and release particles. Accola et al. (2000) showed that 80% of Gag 
could be deleted or replaced by heterologous sequences without significantly 
compromising VLP production. The smallest chimeric molecule still able to efficiently 
form VLPs was 16 kDa. this construct contained a leucine zipper domain of the yeast 
transcription factor GCN4 to substitute for the assembly function of nucleocapsid, 
fcStowed by a PPPPY motif to provide the L domain function, and retained only the 
myristylation signal and the C-terminal CA-p2 domain of Gag. 
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Ability of Gag to stimulate an immune response 

The long-term solution to combating the AIDS epidemic is through immunisation with a 
suitable vaccine and there are numerous HIV-1 genes and epitopes which are currently 
being used to develop vaccines. Although there is no Gag vaccine in use to date, a 
5 number of studies have shown that this protein induces an immune response in HIV-1 - 
infected patients as well as in animals challenged with novel vaccines (Friedman ef a/., 
2000; Revskaya and Frankel, 2001).; ).This response has been shown in some cases to 
be both humoral and cell-mediated (Qui et al., 1999; Leung et al., 2000; Qui et a/., 
2000; Montefiori era/., 2001; Kazanji etal., 2001). 

10 

Goulder et al. (2000) did studies to identify the epitopes, which dominate the CTL 
response in ethnic groups and age groups worst hit by the global epidemic. They 
focussed on Gag because a number of gag specific responses have been shown to be 
associated with protection in HIV infection (Nixon and McMichael, 1991; Riviere et al., 
15 1995). The immunodominant Gag-specific CTL responses appeared to be focussed on 
3 highly immunogenic regions, which together spanned 16% of the total length of Gag 
p17 and Gag p24 proteins but which represent two thirds of the dominant Gag-specific 
CTL responses detected. These results suggest that Gag would be an ideal candidate 
for vaccine design. 

20 

Plants as sources of vaccines 

There are many examples of the use of plants as sources of foreign protein and they are 
considered as viable and competitive expression systems for large-scale protein 
production (Doran, 2000). Favourable reasons for their use include the potential for 
25 large-scale, low-cost biomass production, a low risk of contamination by mammalian 
viruses and other animal pathogens, the ability of plant cells to correctly fold and 
assemble multimeric proteins, and a low processing requirement for proteins 
administered orally in plant food or feed. They are thus considered a viable option for 
the production of foreign proteins, which can be used as vaccines. 

30 

A number of potentially useful vaccine candidates have been produced in plants and 
tested in animals. A useful technique for introducing foreign genes into plants has been 
via viral vector transmission. 
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Usha et al., (1993) have used cowpea mosaic virus (CPMV) particles to express 
epitopes of foot and mouth disease virus (FMDV) on their coat protein as a result of a 
fusion on the coat protein gene. 

Koo et al. (1999) made hybrids of tobacco mosaic virus (TMV) by fusing short epitopes 
from murine hepatitis virus (MHV) to TMV coat protein and subsequently propagating 
them in tobacco plants. 

Fernandez-Fernandez et al. (2001 ) have developed a plum pox potyvirus vector for the 
expression of foreign proteins. They have used it to express an antigenic structural 
protein of rabbit hemorrhagic disease virus (RHDV), producing chimeric virus particles 
which when inoculated into rabbits produce an immune response against RHDV. 

A number of other methods have been used to produce vaccine candidates in plants. 
McCormick et al. (1999) demonstrate the modification of a TMV vector such that it not 
only produces single chain Fv fragments in plants, but secretes them into the apoplast. 
This makes harvesting of the product a lot simpler than having to isolate foreign proteins 
from leaf extracts. Several attempts have also been made to make plants transgenic for 
production of foreign proteins to be used as vaccine candidates. Mason et al. (1996) 
have made transgenic tobacco and potato plants to successfully produce Norwalk virus 
capsid protein, which shows immunogenicity in mice. Wigdorowitz et al. (1999) have 
made transgenic alfalfa plants expressing an antigenic protein against FMDV and shown 
that animals immunised with purified antigen show immunogenicity against the virus. 

There have been a few attempts at producing candidate vaccines against HIV-1 in 
plants. Yusibov et al. (1997) have used the coat protein of alfalfa mosaic virus as a 
carrier molecule to express antigenic peptides from HIV-1 (V3 loop). In vitro transcripts 
of recombinant virus with sequences encoding the antigenic peptides were synthesised 
from DNA constructs and used to inoculate tobacco plants. Recombinant virus particles 
were produced and purified and used for immunisation of mice. The antigens elicited 
specific virus-neutralizing antibodies in immunised mice. Zhang et al. (2000) used a 
tomato bushy stunt virus (TBSV) as an expression vector of HIV-1 p24 protein. This 
gene was introduced into the TBSV genome as an in-frame fusion with a 5" terminal 
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portion of the TBSV coat protein ORF. Introduction into plants led to the accumulation of 
p24 fusion proteins in inoculated leaves. 

There has, to date, been no report in the literature about the ability of GAG VLPs to 
assemble correctly in plants. Although some proteins, and even some HIV proteins, 
have been shown to assemble into VLPs in plants and insects, many other proteins do 
not assemble at all, or do not assemble correctly, in plants or insects, and mammalian 
studies are not reliable indicators of whether VLPs will similarly be produced in plants or 
insects. 



SUMMARY OF THE INVENTION 

According to a first embodiment of the invention, there is provided a vector including a 
nucleotide sequence encoding an HIV Gag polypeptide, wherein the nucleotide 
sequence encoding the Gag polypeptide comprises a sequence having at least 90% 
sequence identity to the sequence set forth in SEQ ID NO: 1 . 



According to a second embodiment of the invention, there is provided a vector including 
a nucleotide sequence encoding an HIV Gag polypeptide, wherein the nucleotide 
sequence encoding the Gag polypeptide comprises a sequence having at least 90% 
homology to the sequence set forth in SEQ ID NO: 2. 



The vector of either embodiment may be a plant virus vector, for example, a tobacco 
25 mosaic virus-derived cDNA cloned vector such as the pBSG1057 vector, or a potyvirus- 
derived cDNA such as tobacco etch virus (TEV) or turnip mosaic virus (TuMV). The 
vector may also be an Agrobacterium tumefaciens containing a T-derived plasmid 
construct. 



Alternatively, the vector may be a baculovirus vector, such as the bacmid vector. 

According to a third embodiment of the invention, there is provided a cell including a 
vector substantially as described above, wherein the nucleotide sequence is operably 
linked to control elements compatible with expression in the cell. 
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The cell may be a plant or insect cell. For example, the cell may be an N. benthamiana 
plant cell or an Sf 21 , Sf 9 or the like cell. 

According to a fourth embodiment of the invention, there is provided a method of 
producing an HIV-1 immunogenic protein or a related polypeptide, the method 
comprising the steps of: 

introducing a vector or vector system into a host cell, the vector or vector system 
including a nucleic acid sequence encoding the HIV-1 immunogenic protein or related 
polypeptide derived by substitution, deletion and/or insertion of one or more nucleotides, 
and/or extension or truncation of one or both ends thereof, the nucleic acid sequence 
having at least 90% identity to the sequence set forth in SEQ ID NO:1 ; 

causing expression of the nucleic acid sequence in the host cell; and 
recovering the resulting HIV-1 immunogenic protein or related polypeptide 
produced within the host cell. 

According to a fifth embodiment of the invention, there is provided a method of 
producing an HIV-1 immunogenic protein or a related polypeptide, the method 
comprising the steps of: 

introducing a vector or vector system into a host cell, the vector or vector system 
including a nucleic acid sequence encoding the HIV-1 immunogenic protein or related 
polypeptide derived by substitution, deletion and/or insertion of one or more nucleotides, 
and/or extension or truncation of one or both ends thereof, the nucleic acid sequence 
having at least 90% identity to the sequence set forth in SEQ ID NO:2; 

causing expression of the nucleic acid sequence in the host cell; and 
recovering the resulting HIV-1 immunogenic protein or related polypeptide 
produced within the host.cell. 

The vector and host cell may be substantially as described above. 

According to a further embodiment of the invention, there is provided an HIV-1 protein or 
polypeptide that is produced according to the method substantially as described above. 

The protein may be an HIV-1 Pr55 Gag protein, and may be assembled into the form of 
virus-like particles (VLPs). 
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According to a further embodiment of the invention, there is provided a vaccine for use in 
the treatment or prophylaxis of HIV infection in a mammal, the vaccine including virus- 
like particles of proteins or polypeptides substantially as described above. 

The vaccine may induce an immunogenic response to the virus-like particles in a 
suitable susceptible host. 

The vaccine may include a pharmaceutical excipient and/or adjuvant, and a 
therapeutically effective amount of the virus-like particles. 

DESCRIPTION OF THE DRAWINGS 



Figure 1 
Figure 2 
Figure 3 
Figure 4 
Figure 5 
Figure 6 



Figure 7 
Figure 8 
Figure 9 
Figure 10 

Figure 11 



shows a schematic representation of the gag open reading frame of the 
HIV-1 genome; 

shows a schematic representation of the matrix (MA) protein (p24) 
domain of the gag gene of Figure 1 ; 

shows a schematic representation of the capsid (CA) protein (p24) 
domain of the gag gene of Figure 1 ; 

shows a schematic representation of the nucleocapsid (NC) protein (p7) 
domain of the gag gene of Figure 1 ; 

shows a schematic representation of the domain of the p6 protein domain 
of the gag gene of Figure 1 ; 

shows the DNA sequence of the Du422 gag sequence used for cloning 
into pBSG1057 (SEQ ID NO: 1). The bold underlining represents the gag 
gene and the dotted underlining represents the partial pol fragment; 
shows a plasmid map of pBSG1057; 
shows a plasmid map of pBSGgag6; 
shows a plasmid map of pBSGgagoptl 1 ; 

shows the DNA sequence of the native Du422 gag sequence (SEQ ID 
NO: 2); 

shows HIV-1 subtype C Pr55 Gag VLPs resulting from gagopt expression 
immunotrapped onto carbon-coated grids using anti-p17 monoclonal 
antibody (Chemicon) (the bar represents 100nm); 
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Figure 12 shows (a) HiV-1 subtype C Gag VLPs produced in transfected Sf21 cells 
and (b) Gag VLPs budding into the extracellular medium from Sf21 
plasma membrane (the bar represents 100 nm); 

Figure 13 shows the amino acid sequence of the nucleotide sequence of Figure 6 
(SEQ ID NO: 3); and 

Figure 14 shows the amino acid sequence of the nucleotide sequence of Figure 10 
(SEQ ID NO: 4). 

DETAILED DESCRIPTION OF AN EMBODIMENT OF THE INVENTION 

The invention will now be described in more detail with reference to particular 
embodiments of the invention. 

Cloning of Gag into TMV vector PBSG1057 

Transient expression in tobacco was achieved using the vector plasmid pBSG1057 from 
Large Scale Biology Corporation. When in vitro transcribed into RNA, this vector 
provides an infectious engineered tobacco mosaic virus which expresses native or plant 
codon optimised Du422 Pr 55 Gag. 

The Gag gene was obtained from HIV-1 isolate DU422 (obtained from a South African 
sex worker cohort, and assigned provisional accession no. 01032114 by the European 
Collection of Cell Cultures) (Figure 6). It comprises the entire gag gene sequence and 
the first 57 bases of the pol gene sequence (SEQ ID NO: 2). It was cloned into the 
EcoRI and Sail restriction enzyme sites of an E. coli vector pGEM-T easy™. The ends 
of the gene were modified by PCR amplification such that Pad and Xho\ restriction 
enzyme sites were attached to the 5* and 3' ends respectively, to facilitate cloning into 
the TMV vector pBSG1057 (Figure 7). Amplification products were re-cloned into 
pGEM-T easy™ and sequenced to verify the integrity of the restriction enzyme sites and 
the gag sequence. 

The green fluorescent protein (GFP) gene sequence was excised from pBSG1057 by 
restriction enzyme digestion with Pad and Xho\, and gag cloned into the TMV vedor at 
these 2 sites to produce the clone pBSGgag6 (Figure 8). 
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It is theorised that the identical gag construct could be cloned in frame into a potyvirus-derived 
cDNA clone, flanked by appropriate endoproteinase recognition sequences derived from a 
potyvirus proteome, for expression via in v/fro-transcribed infectious recombinant potyviral RNA. 
The same would be true for vectors derived from any plant virus. 

It is also theorized that large-scale transient expression in tobacco could be performed by means 
of infiltrating tobacco leaves with an Agrobacterium suspension. Using this method, many cells 
could be transformed and the protein could be produced somatically rather than first having to 
produce callus and then transgenic plants (Kapila et al., 1997). The vector may be 
Agrobacterium tumefaciens containing a T-derived plasmid construct. 

A tobacco etch virus, such as those available from Large Scale Biology Corporation, could also 
be used in the invention. 

Codon-optimisation of gag gene for Nicotians 

An additional strategy was undertaken to codon-optimise the gag gene for Nicotians, synthesise 
it, and clone it using a similar strategy to that described above into the pBSG1057 TMV vector 
with the hope that this gene would enhance Gag protein expression when introduced into N. 
benthamiana. The codon-optimised gag gene sequence (SEQ ID NO: 1) was cloned into the 
Pad and Xho\ restriction enzyme sites of pBSG1057 after removal of GFP. The resultant clone 
was called pBSGgagoptH (Figure 9). 

Transcription of pBSGqaa6. pBSj 3gagopt11 and PBSG1057 

mRNA of pBSGgag6, pBSGgagoptH and pBSG1057 was produced using a Ribomax 
Transcription/translation kit (Promega). Ten micrograms of each plasmid was used per reaction. 

Infection of Nicotian a benthamiana with recombinant TMV mRNA 

N. benthamiana plants were inoculated with mRNA transcripts of the pBSGgag6 and 
pBSGgagoptH clones as well as with the TMV vector containing GFP (pBSG1057) as described 
previously. Water-inoculated plants served as negative controls. 

The mRNA (50uJ) was rubbed over an expanding leaf of 6-week old N. benthamiana 
plants using cotton-wool buds. Plants were grown under normal growth conditions in 
plant rooms and monitored daily with a UV light for the appearance of green fluorescent 
spots (GFP) in both inoculated and upper leaves of the control plants inoculated with 
PBSG1057 mRNA transcripts, as well as for TMV symptoms in pBSGgag6-, 
pBSGgagoptH- and pBSGl057-inoculated plants. Systemic spread of GFP was used 
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as an indicator of systemic spread of recombinant TMV and leaves were sampled for 
detection of Gag protein by western blotting, EM analysis and ELISA. 

TMV symptoms and GFP fluorescence: 

Green fluorescent spots were visible under the UV light on the inoculated leaves of 
those infected with pBSG1057 mRNA transcript at 4 days post inoculation (dpi). Spread 
of the GFP spots to upper leaves was visible at 10 dpi. TMV symptoms were visible in 
the newer growth of pBSG1057-inoculated plants at 17 dpi and in the pBSGgag6- and 
pBSGgagoptl 1 -inoculated plants at 24 dpi. 

Detection of Gag protein 

Crude protein preparations were made by crushing up leaves using a mortar and pestle, 
filtration of remaining solid matter through cheesecloth, and addition of loading buffer. 

Western blotting 

The samples were boiled for 5 minutes and run on 10% SDS polyacrylamide gels to 
separate the proteins. Resolved proteins were transblotted from the SDS 
polyacrylamide gels onto nitrocellulose membranes. Membranes were probed with an 
anti-mouse p17 monoclonal antibody (Chemicon) at a dilution of 1 :200. 

Lanes containing crude protein preparations from pBSGgag6- and pBSGgagoptl 1- 
inoculated leaves did not yield any positive result compared with baculovirus-produced 
gag (see below) which highlighted the presence of a 55kD protein. 

EM analysi s for Gao VLPs nmmunotrapoina) 

Crude apical leaf extracts (35 dpi) were ground up in PBS (pH 7.4) and centrifuged at 
low speed to pellet remaining unground tissue. Small amounts of leaf extract were dried 
on copper grids and immunotrapped with an anti-mouse p17 monoclonal antibody 
(Chemicon) at a dilution of 1:200. The grids were counterstained with uranyl acetate and 
viewed using a transmission electron microscope. 

There was very little plant material present in the pBSG1057-inoculated sample (no 
particles ranging from 100 to 120 nm in size) and very few TMV particles. 
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Apparent virus-like particles which were morphologically identical to baculovirus- 
expressed Gag Pr55 VLPs (Fig 12) were detected in concentrated extracts made from 
N. benthamiana plants (Figure 11). In the pBSGgag6-lnoculated sample there were a 
large number of 100 to 110 nm-sized VLPs as well as quite a few TMV subunit-sized 
particles (25 nm). In the pBSGgagoptl 1-inoculated sample, there were similar shaped 
but less 100 to 110 nm-sized particles to those seen in the pBSGgag6-inoculated 
sample. 

ELISA 

Crude leaf extracts were centrifuged at low speed to pellet remaining unground tissue. 
Small amounts were diluted in 1 X PBS and used for detection of p24 antigen by using 
the Vironostika® p24 HIV-1 antigen ELISA kit. 

Preliminary results of ELISA performed on crude preparations of protein showed the 
presence of gag protein in pBSGgagoptl 1-inoculated leaves at a level of 460 pg/ml 
crude preparation protein solution. Expression of the gagopt gene in N. benthamiana 
plants systemically infected with the appropriate vectors was significantly higher than 
native gag gene. 

Baculovir us expression of HIV-1 subtype C gag VLPs 

HIV-1 subtype C Gag VLPs were produced using the Bac-to-Bac® baculovirus 
expression system (Life Technologies). These provide a relevant positive control for 
further protein (Gag) detection experiments and can be used to generate antibodies 
specific to HIV-1 subtype C Gag. 

The HIV-1 subtype C gag gene from the South African HIV isolate Du422 (Williamson et 
al., 2003) (SEQ ID NO: 2) was cloned into the multiple clpning site pFastBad, and 
transposed into competent E. coli DHIOBac cells which were then screened for 
successful transposition into the baculovirus shuttle vector (bacmid). 

Gag VLPs were produced in Spodoptera frugiperda (Sf21) cells via recombinant 
baculovirus expressing the full-length myristylated Pr55Gag precursor protein, according 
to the manufacturer's protocols (Gibco Life Sciences). The cells were incubated in 
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TC100 medium (Gibco Life Sciences) supplemented with foetal calf serum at 28°C for 
84 h. 

Because the cloned gag gene contains an N-terminal glycine residue sequence, it is 
expected that the recombinant protein be targeted to the host plasma membrane. VLPs 
thus formed were subsequently budded from the cell surface into the insect cell medium. 
Transfected Sf21 ceils were separated from VLPs which had budded into the culture 
medium by centrifugatidn at 3000 g. Putative Pr55Gag VLPs were purified from the 
culture fluid on sucrose gradients as described by Nermut et al. (1994). Purified VLPs 
were dialysed for 16 h in 1 x phosphate-buffered saline (PBS) at 4°C and Gag content 
and integrity was evaluated by western blotting using antiserum to HIV-1 p17 (ARP431, 
NIBSC) diluted 1 in 1000 in 1 x PBS (pH 7.4) after SDS-PAGE on 10% gels. 

The process of VLP production by Sf21 cells was visualised by transmission electron 
microscopy (TEM). Recombinant virus-infected cells were prepared for ultrathin 
sectioning by fixing cells sequentially in 2.5% glutaraldehyde and 1% osmium tetroxide 
in 1 x PBS (pH 7.4). . Fixed cells were washed in 1 x PBS and water, and then 
dehydrated in graded ethanol solutions and 100% acetone, after which they were 
embedded in Spurrs resin and sectioned. Sections were stained with both 2% uranyl 
acetate and Reynolds lead citrate and viewed using a Zeiss S1109 electron microscope 
at magnifications of 12000x to 100 000' using an accelerating voltage of 80 kV. 

Gag VLPs harvested from the extracellular medium were prepared for TEM by 
adsorption onto carbon-coated copper grids and staining with 2% uranyl acetate or 2% 
methylamine tungstate (Figure 12). 

VLPs of approximately 110 to 120 nm in diameter were visualized under the electron 
microscope, verifying successful gag VLP production. A single 55kD protein band was 
visualized in samples resolved on an SDS page gel, and a monoclonal and a polyclonal 
srStocGy to gag P17 protein were found to react positively using western blot analysis. 
Two additional monoclonal antibodies to gag P24 protein were tested subsequently 
against baculovirus-derived VLPs and reacted positively using western blot analysis. 
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Vaccine development 

It is envisaged that the immunogenic VLPs produced in the plant and insect cells, as 
described above, will be used in the manufacture of a vaccine for use in the treatment or 
prophylaxis of HIV infection in a mammal. The vaccine would be expected to induce an 
immunogenic response to the virus-like particles in the mammal. In addition to a 
therapeutically effective amount of the virus-like particles, the vaccine could include a 
pharmaceutical excipient and/or adjuvant. 

Although the invention has been described above with reference to particular 
embodiments, it is not intended that this should limit the invention to what has been 
described above. 
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CLAIMS 

1. A vector including a nucleotide sequence encoding an HIV Gag polypeptide, 
wherein the nucleotide sequence encoding the Gag polypeptide comprises a 
sequence having at least 90% sequence identity to the sequence set forth in SEQ 
IDNO:1. 

2. A vector including a nucleotide sequence encoding an HIV Gag polypeptide, 
wherein the nucleotide sequence encoding the Gag polypeptide comprises a 
sequence having at least 90% homology to the sequence set forth in SEQ ID NO: 
2. 

3. A vector according to either of claims 1 or 2, which is a plant vector. 

4. A vector according to claim 3, which is a tobacco mosaic virus vector. 

5. A vector according to claim 3, which is an Agrobacterium tumefaciens containing a 
T-derived plasmid construct. 

. • 6. A vector according to either one of claims 1 or 2, which is a baculovirus vector. 

7. A cell including a vector according to any one of claims 1 to 6, wherein the 
nucleotide sequence is operably linked to control elements compatible with 
expression in the cell. 

8. A cell according to claim 7, which is a plant cell. 

.9. A cell according to claim 8, which is a N. benthamiana plant cell. 

10. A cell according to claim 7, which is an insect cell. 

11. A cell according to claim 1 0, which is an Sf 21 or Sf 9 ceil. 
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12. A method of producing an HIV-1 immunogenic protein or a related polypeptide, the 
method comprising the steps of: 

(a) introducing a vector or vector system into a host cell, the vector or vector 
system including a nucleic acid sequence encoding the HIV-1 immunogenic protein 
or related polypeptide derived by substitution, deletion and/or insertion of one or 
more nucleotides, and/or extension or truncation of one or both ends thereof, the 
nucleic acid sequence having at least 90% identity to the sequence set forth in 
SEQID NO:1; 

(b) causing expression of the nucleic acid sequence in the host cell; and 

(b). recovering the resulting HIV-1 immunogenic protein or related polypeptide 
produced within the host cell. 

13. A method of producing an HIV-1 immunogenic protein or a related polypeptide, the 
method comprising the steps of: 

(a) introducing a vector or vector system into a host cell, the vector or vector 
system including a nucleic acid sequence encoding the HIV-1 immunogenic protein 
or related polypeptide derived by substitution, deletion and/or insertion of one or 
more nucleotides, and/or extension or truncation of one or both ends thereof, the 
nucleic acid sequence having at least 90% identity to the sequence set forth in 
SEQIDNO:2; 

(b) causing expression of the nucleic acid sequence in the host cell; and 

(b) recovering the resulting HIV-1 immunogenic protein or related polypeptide 
produced within the host cell. 

14. A method according to either one of claims 12 or 13, wherein the vector is a plant 
vector. 

15. A method according to claim 14, wherein the vector is a tobacco mosaic virus 
vector. 

16. A method according to claim 14, wherein the vector is an Agrobacterium 
tumefaciens containing a T-derived plasmid construct. 



23 



WO 2004/050691 



PCT7IB2003/005634 



17. A method according to either one of claims 12 or 13, wherein the vector is a 
baculovirus vector. 

18. A method according to any one of claims 12 to 16, wherein the host cell is a plant 
cell. 

19. A method according to claim 18, wherein the plant cell is a N. benthamiana plant 
cell. 

20. A method according to any one of claims 12, 13 and 17, wherein the host cell is an 
insect cell. 

21 . A method according to claim 20, wherein the insect cell is an Sf 21 or Sf 9 cell. 

22. An HIV-1 protein or polypeptide that is produced according to the method of any 
one of claims to 12 to 21. 

23. A protein or polypeptide according to claim 22, which is an HIV-1 Pr55 Gag protein. 

24. A protein or polypeptide according to either of claims 22 or 23, which is assembled 
into the form of virus-like particles (VLPs). 

25. A vaccine for use in the treatment or prophylaxis of HIV infection in a mammal, the 
vaccine including virus-like particles of proteins or polypeptides as described in any 
one of claims 22 to 24. 

26. A vaccine according to claim 25, which induces an immunogenic response to the 
virus-like particles in a suitable susceptible host. 

27. A vaccine according to either one of claims 25 or 26, which includes a 
pharmaceutical excipient and/or adjuvant, and a therapeutically effective amount of 
the virus-like particles. 
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1 ATGGGTGCGA GAGCGTCAAT ATTAAGAGGG GAAAAATTAG ATAAATGGGA AAAGATTAGG 

61 TTAAGGCCAG GGGGAAAGAA ACATTATATG TTAAAACACA TAGTATGGGC GAGCAGGGAG 

121 CTGGAAAGAT TTGCACTTAA CCCTGGCCTT TTAGAAACAT CAGAAGGATG TAAACAAATA 

181 ATGAAACAGC TACAACCAGC TCTCCAGACA GGAACAGAGG AACTTAAATC ATTATACAAC 

241 ACAGTAGCAA CTCTCTATTG TGTACATGAA AAGATAGAAG TACGAGACAC CAAGGAAGCC 

301 TTAGATAAGA TAGAGGAAGA ACAAAACAAA TGTCAGCAAA AAACGCAGCA GGCAAAAGCG 

361 GCTGACGGGA AAGTCAGTCA AAATTATCCT ATAGTGCAGA ATCTCCAAGG GCAAATGGTA 

421 CATCAAGCCA TATCACCTAG AACCTTGAAT GCATGGGTAA AAGTAATAGA AGAAAAGGCT 

481 TTTAGCCCAG AGGTAATACC CATGTTTACA GCATTATCAG AAGGAGCCAC CCCACAAGAT 

-4. TTAAACACCA TGT TAAATAC AGTGGGGGGA CACCAAGCAG CCATGCAAAT GTTAAAAGAT 

601 ACTATTAATG AAGAGGCTGC AGAATGGGAT AGATTACATC CAGTCCATGC GGGGCCTATT 

661 GCACCAGGCC AGATGAGAGA ACCAAGGGGA AGTGACATAG CAGGAACTAC TAGTACCCTT 

721 CAGGAACAAA TAGCATGGAT GACAAGTAAC CCACCTATTC CAGTGGGAGA CATCTATAAA 

781 AGATGGATAA TTCTGGGGTT AAATAAAATA GTGAGAATGT ATAGCCCGGT CAGCATTTTG 

841 GACATAAGAC AAGGGCCAAA GGAACCCTTT CGAGACTATG TAGATCGGTT CTTTAAAACT 

901 TTAAGAGCTG AACAAGCTAC ACAAGAAGTA AAAAATTGGA TGACAGACAC CTTGTTAGTC 

961 CAAAATGCGA ACCCAGATTG TAAGACCATT TTGAGAGCAT TAGGACCAGG GGCTACATTA 

xO^.1 GAAGAAATGA TGACAGCATG TCAAGGGGTG GGAGGACCTG GCCACAAAGC AAGAGTATTG 

1081 GCTGAGGCAA TGAGTCAAAC AAACAGTGGA AACATAATGA TGCAGAGAAG CAATTTTAAA 

1141 GGCCCTAGAA GAATTGTTAA ATGTTTTAAC TGTGGCAAGG AAGGGCACAT AGCCAGAAAT 

1201 TGCAGAGCCC CTAGGAAAAA AGGCTGTTGG AAATGTGGAA AAGAAGGACA CCAAATGAAA 

l*vx GACTGCACTG AGAGGCAGGC TAATTTTTTA GGGAAAATTT GGCCTTCCCA CAAGGGGAGG 

1321 CCAGGGAATT TCCTTCAGAA CAGACCAGAG CCAACAGCCC CACCAGCAGA GAGCTTCAGG 

1381 TTCGAAGAGA CAACCCCCGC TCCGAAACAG GAGCCGATAG AAAGGGAACC CTTAACTTCC 

1441 CTCAAATCAC TCTTTGGCAG CGACCCCTTG TCTCAATAA 
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SEQUENCE LISTING 

<110> university of cape Town 

south African Medical Research Council 

<120> a method for the production of hiv-1 Gag virus-Like Particles 

<130> PA132610/PCT 

<140> PCT/IB03/ 
<141> 2003-12-04 

<160> 4 

<170> Patentln version 3.1 

<210> 1 
<211> 1549 
<212> DNA 

<213> Human immunodeficiency virus type 1 
<400> 1 

gaattcatgg gtgcgagagc gtcaatatta agaggggaaa aattagataa atgggaaaag 60 

attaggttaa ggccaggggg aaagaaacat tatatgttaa aacacatagt atgggcgagc 120 

agggagctgg aaagatttgc acttaaccct ggccttttag aaacatcaga aggatgtaaa 180 

caaataatga aacagctaca accagctctc cagacaggaa cagaggaact taaatcatta 240 

tacaacacag tagcaactct ctattgtgta catgaaaaga tagaagtacg agacaccaag 300 

gaagccttag ataagataga ggaagaacaa aacaaatgtc agcaaaaaac gcagcaggca 360 

aaagcggctg acgggaaagt cagtcaaaat tatcctatag tgcagaatct ccaagggcaa 420 

atggtacatc aagccatatc acctagaacc ttgaatgcat gggtaaaagt aatagaagaa 480 

aaggctttta gcccagaggt aatacccatg tttacagcat tatcagaagg agccacccca 540 

caagatttaa acaccatgtt aaatacagtg gggggacacc aagcagccat gcaaatgtta 600 

aaagatacta ttaatgaaga ggctgcagaa tgggatagat tacatccagt ccatgcgggg 660 

cctattgcac caggccagat gagagaacca aggggaagtg acatagcagg aactactagt 720 

acccttcagg aacaaatagc atggatgaca agtaacccac ctattccagt gggagacatc 780 

tataaaagat ggataattct ggggttaaat aaaatagtga gaatgtatag cccggtcagc 840 

attttggaca taagacaagg gccaaaggaa ccctttcgag actatgtaga tcggttcttt 900 

aaaactttaa gagctgaaca agctacacaa gaagtaaaaa attggatgac agacaccttg 960 
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ttagtccaaa atgcgaaccc agattgtaag accattttga gagcattagg accaggggct 1020 

acattagaag aaatgatgac agcatgtcaa ggggtgggag gacctggcca caaagcaaga 1080 

gtattggctg aggcaatgag tcaaacaaac agtggaaaca taatgatgca gagaagcaat 1140 

tttaaaggcc ctagaagaat tgttaaatgt tttaactgtg gcaaggaagg gcacatagcc 1200 

agaaattgca gagcccctag gaaaaaaggc tgttggaaat gtggaaaaga aggacaccaa 1260 

atgaaagact gcactgagag gcaggctaat tttttaggga aaatttggcc ttcccacaag 1320 

gggaggccag ggaatttcct tcagaacaga ccagagccaa cagccccacc agcagagagc 1380 

ttcaggttcg aagagacaac ccccgctccg aaacaggagc cgatagaaag ggaaccctta 1440 

acttccctca aatcactctt tggcagcgac cccttgtctc aataaaagta gggggccaga 1500 

caagggaggc tctcttagac acaggagcag atgatacagt attgtcgac 1549 

<210> 2 
<211> 1479 
<212> DNA 

<213> Human immunodeficiency virus type 1 



<400> 2 

atgggtgcga gagcgtcaat attaagaggg gaaaaattag ataaatggga aaagattagg 60 

ttaaggccag ggggaaagaa acattatatg ttaaaacaca tagtatgggc gagcagggag 120 

ctggaaagat ttgcacttaa ccctggcctt ttagaaacat cagaaggatg taaacaaata 180 

atgaaacagc tacaaccagc tctccagaca ggaacagagg aacttaaatc attatacaac 240 

acagtagcaa ctctctattg tgtacatgaa aagatagaag tacgagacac caaggaagcc 300 

ttagataaga tagaggaaga acaaaacaaa tgtcagcaaa aaacgcagca ggcaaaagcg 360 

gctgacggga aagtcagtca aaattatcct atagtgcaga atctccaagg gcaaatggta 420 

catcaagcca tatcacctag aaccttgaat gcatgggtaa aagtaataga agaaaaggct 480 

tttagcccag aggtaatacc catgtttaca gcattatcag aaggagccac cccacaagat 540 

ttaaacacca tgttaaatac agtgggggga caccaagcag ccatgcaaat gttaaaagat 600 

actattaatg aagaggctgc agaatgggat agattacatc cagtccatgc ggggcctatt 660 

gcaccaggcc agatgagaga accaagggga agtgacatag caggaactac tagtaccctt 720 

caggaacaaa tagcatggat gacaagtaac ccacctattc cagtgggaga catctataaa 780 

agatggataa ttctggggtt aaataaaata gtgagaatgt atagcccggt cagcattttg 840 

gacataagac aagggccaaa ggaacccttt cgagactatg tagatcggtt ctttaaaact 900 

ttaagagctg aacaagctac acaagaagta aaaaattgga tgacagacac cttgttagtc 960 

caaaatgcga acccagattg taagaccatt ttgagagcat taggaccagg ggctacatta 1020 

gaagaaatga tgacagcatg tcaaggggtg ggaggacctg gccacaaagc aagagtattg 1080 

gctgaggcaa tgagtcaaac aaacagtgga aacataatga tgcagagaag caattttaaa 1140 

ggccctagaa gaattgttaa atgttttaac tgtggcaagg aagggcacat agccagaaat 1200 

tgcagagccc ctaggaaaaa aggctgttgg aaatgtggaa aagaaggaca ccaaatgaaa 1260 

gactgcactg agaggcaggc taatttttta gggaaaattt ggccttccca caaggggagg 1320 
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ccagggaatt tccttcagaa cagaccagag ccaacagccc caccagcaga gagcttcagg 1380 
ttcgaagaga caacccccgc tccgaaacag gagccgatag aaagggaacc cttaacttcc 1440 
ctcaaatcac tctttggcag cgaccccttg tctcaataa 1479 

<210> 3 

<211> 513 

<212> PRT 

<213> Human immunodeficiency virus type 1 

<400> 3 

Glu Phe Met Gly Ala Arg Ala ser lie Leu Arg Gly Glu Lys Leu asd 
1 5 10 15 

Lys Trp Glu Lys lie Arg Leu Arg pro Gly Gly Lys Lys His Tyr Met 
20 25 30 

Leu Lys His He Val Trp Ala Ser Arg Glu Leu Glu Arg Phe Ala Leu 
33 40 45 

Asn pro Gly Leu Leu Glu Thr ser Glu Gly cys L^s Gin He Met Lys 

Gin Leu Gin Pro Ala Leu Gin Thr Gly Thr Glu Glu Leu Lys ser Leu 
^ 70 75 80 

Tyr Asn Thr val Ala Thr Leu Tyr cys val His Glu Lys He Glu Val 
85 90 95 

Arg Asp Thr Lys Glu Ala Leu Asp Lgs lie Glu Glu Glu Gin Asn Lys 

cys Gin Gin Lys Thr Gin Gin Ala Lys Ala Ala Asp Gly Lys val ser 
" 120 125 

Gin Asn Tyr Pro He Val Gin Asn Leu Gin Gly Gin Met val His Gin 
XJU 135 140 

Ala He ser Pro Arg Thr Leu Asn Ala Trp val Lys val lie Glu Glu 
J/ " 150 155 160 

Lys Ala Phe ser Pro Glu val lie pro Met Phe Thr Ala Leu ser Glu 
ivS 170 175 

Gly Ala Thr Pro Gin Asp Leu Asn Thr Met Leu Asn Thr val Gly Glv 
"O 185 190 

His Gin Ala Ala Met Gin Met Leu Lys Asp Thr He Asn Glu Glu Ala 
x ^ 200 205 

Ala Glu Trp Asp Arg Leu Hi| Pro Val His Ala Gly Pro He Ala Pro 
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Gly Gin Met Arg Glu Pro Arg Gly ser Asp lie Ala Gly Thr Thr Ser 
" 5 230 235 240 

Thr Leu Gin Glu Gin lie Ala Trp Met Thr ser Asn Pro Pro lie Pro 
245 250 255 

val Gly Asp lie Tyr Lys Arg Trp lie lie Leu Gly Leu Asn Lys lie 
260 265 270 

val Arg Met Tyr ser Pro Val Ser lie Leu Asp He Arg Gin Gly Pro 
275 280 285 

Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg phe Phe Lys Thr Leu Arg 
290 295 300 

Ala Glu Gin Ala Thr Gin Glu Val Lys Asn Trp Met Thr Asp Thr Leu 
3Ub 310 315 320 

Leu val Gin Asn Ala Asn pro Asp cys Lys Thr lie Leu Arg AlaLeu 
325 330 335 

Gly Pro Gly Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gin Gly val 
340 345 350 

Gly Gly Pro Gly His Lys Ala Arg Val Leu Ala Glu Ala Met ser Gin 

3:>:> 360 365 

Thr Asn ser Gly Asn lie Met Met Gin Arg Ser Asn Phe Lys Gly Pro 
370 375 380 

Arg Arg He val Lys Cys Phe Asn Cys Gly Lys Glu Gly His lie Ala 
J85 390 395 400 

Arg Asn cys Arg Ala Pro Arg Lys Lys Gly cys Trp Lys cys Gly Lys 
405 410 415 

Glu Gly His Gin Met Lys Asp cys Thr Glu Arg Gin Ala Asn Phe Leu 
420 425 430 

-Gly Lys lie Trp pro Ser His Lys Gly Arg Pro Gly Asn Phe Leu Gin 
435 440 445 

Asn Arg Pro Glu Pro Thr Ala Pro pro Ala Glu Ser Phe Arg Phe Glu 
450 455 460 

mk ^ Pro Ala ?™ Lys Gln Glu Pro Ile Glu Ar 9 Glu Pro Leu 
* D:> 470 475 480 

Thr ser Leu Lys Ser Leu Phe Gly ser Asp Pro Leu ser Gin Lys Gly 
485 490 495 

Ala Arg Gin Gly Arg Leu Ser Thr Gin Glu Gin Met lie Gin Tyr cys 
500 505 510 

Arg 
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<210> 4 

<211> 492 

<212> PRT 

<213> Human immunodeficiency virus type 1 

<400> 4 

Met Gly Ala Arg Ala Ser He Leu Arg Gly Glu Lys Leu Asp Lys Trp 
A 5 10 15 

Glu Lys He Arg Leu Arg Pro Gly Gly Lys Lys His Tyr Met Leu Lys 
20 25 30 

His He val Trp Ala ser Arg Glu Leu Glu Arg Phe Ala Leu Asn Pro 
35 40 45 

Gly Leu Leu Glu Thr Ser Glu Gly cys Lys Gin He Met Lys Gin Leu 
-> u 55 60 

Gin Pro Ala Leu Gin Thr Gly Thr Glu Glu Leu Lys ser Leu Tyr Asn 
0b 70 75 80 

Thr val Ala Thr Leu Tyr cys Val His Glu Lys lie Glu val Arq asd 
85 90 95 s 

Thr Lys Glu Ala Leu Asp Lys lie Glu Glu Glu Gin Asn Lys cys Gin 
100 105 HO 

Gin Lys Thr Gin Gin Ala Lys Ala Ala Asp Gly Lys val Ser Gin Asn 
115 120 125 

Tyr Pro He val Gin Asn Leu Gin Gly Gin Met val His Gin Ala He 
i:3U 135 140 

ser Pro Arg Thr Leu Asn Ala Trp val Lys val He Glu Glu Lys Ala 
^ 150 155 160 

Phe ser Pro Glu Val He Pro Met Phe Thr Ala Leu ser Glu Gly Ala 
165 170 175 

Thr pro Gin Asp Leu Asn Thr Met Leu Asn Thr val Gly Gl^ His Gin 

Ala Ala Met Gin Met Leu Lys Asp Thr lie Asn Glu Glu Ala Ala Glu 
195 200 205 

Trp Asg Arg Leu His Pro Val His Ala Gly Pro lie Ala Pro Gly Gin 

Met Arg Glu Pro Arg Gly Ser Asp He Ala Gly Thr Thr Ser Thr Leu 

230 235 240 

Gin Glu Gin lie Ala Trp Met Thr ser Asn Pro pro He Pro Val Gly 
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245 250 255 

Asp lie Tyr Lys Arg Trp He lie Leu Gly Leu Asn Lys He Val Arg 
260 265 270 

Met Tyr Ser Pro Val ser He Leu Asp lie Arg Gin Gly Pro Lys Glu 
275 280 285 

Pro Phe Arg Asp Tyr val Asp Arg Phe Phe Lys Thr Leu Arg Ala Glu 
290 295 300 

Gin Ala Thr Gin Glu val Lys Asn Trp Met Thr Asp Thr Leu Leu val 
305 310 315 320 

Gin Asn Ala Asn Pro Asp Cys Lys Thr lie Leu Arg Ala Leu Gly Pro 
325 330 335 

Gly Ala Thr Leu Glu Glu Met Met Thr Ala cys Gin Gly val Gly Gly 
340 345 350 

Pro Gly His Lys Ala Arg val Leu Ala Glu Ala Met ser Gin Thr Asn 
355 360 365 

ser Gly Asn He Met Met Gin Arg ser Asn Phe Lys Gly pro Arg Arg 
370 375 380 

lie val Lys cys Phe Asn cys Gly Lys Glu Gly His lie Ala Arg Asn 
385 390 395 400 

cys Arg Ala Pro Arg Lys Lys Gly Cys Trp Lys cys Gly Lys Glu Gly 
405 410 415 

His Gin Met Lys Asp Cys Thr Glu Arg Gin Ala Asn Phe Leu Gly Lys 
420 425 430 

lie Trp Pro ser His Lys Gly Arg Pro Gly Asn Phe Leu Gin Asn Arg 
435 440 445 

Pro Glu Pro Thr Ala Pro Pro Ala Glu Ser Phe Arg Phe Glu Glu Thr 
450 455 460 

Thr Pro Ala Pro Lys Gin Glu Pro lie Glu Arg Glu Pro Leu Thr Ser 
465 470 475 480 

Leu Lys ser Leu Phe Gly Ser Asp Pro Leu Ser Gin 
485 490 
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